ACO-based document clustering method

نویسنده

  • Lukasz Machnik
چکیده

Ant systems are flexible to implement and give possibility to scale because they are based on multi agent cooperation. The aim of this publication is to show the universal character of that solution and potentiality in implementing it in wide areas of applications. The increase of demand for effective methods of large document collections management is a sufficient stimulus to place the research on the new application of ant based systems in the area of text document processing. Hitherto existing far generated ant based clustering methods are presented and briefly described at the beginning of that article. Next, the author defines the ACO (Ant Colony Optimization) metaheuristic, which was the basis of the method developed by him. Presentation of the details of the ant based documents clustering method is the main part of publication. 1. State of research on ant-based clustering methods Ant based algorithms are assigned to the group of multiagent systems. In such systems single agent (artificial ant) behavior is inspired by behavior of real ants. Ant based clustering and sorting algorithm Ant based clustering and sorting algorithm was first introduced by Deneubourg in 1990 [1]. As its name implies, two types of natural ant behavior are modeled by this algorithm. Firstly, clustering, where ants gather items to form heaps. An example for this is the clustering of dead corpses (cemetery formation) observed in the species of Pheidole pallidula. Secondly, sorting, where ants discriminate different kinds of items and spatially arrange them according to their properties. This type of activity can be observed in nests of Leptothorax unifasciatus, where larvae are arranged dependent on their sizes. In the Deneubourg’s model ants are modeled by simple agents, which randomly move in their environment, which is a square grid with periodic boundary conditions. Data items that are scattered within this environment can be picked E-mail address: [email protected] Pobrane z czasopisma Annales AIInformatica http://ai.annales.umcs.pl Data: 08/12/2017 07:22:39

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

ACO documents clustering - details of processing and results of experiments

Ant algorithms, particularly Ant Colony Optimization met-heuristic, are a universal and flexible solution. In this publication the author presents the implementation of that technique in the documents clustering area – the new documents clustering method. The aim of this document is to present the details of the ACO documents clustering method, potential ways to optimize its processing and deta...

متن کامل

A Novel Ant-Based Clustering Approach for Document Clustering

Recently, much research has been proposed using nature inspired algorithms to perform complex machine learning tasks. Ant Colony Optimization (ACO) is one such algorithm based on swarm intelligence and is derived from a model inspired by the collective foraging behavior of ants. Taking advantage of the ACO in traits such as self-organization and robustness, this paper proposes a novel document ...

متن کامل

Classification with cluster-based Bayesian multi-nets using Ant Colony Optimisation

Bayesian Multi-net (BMN) classifiers consist of several local models, one for each data subset, to model asymmetric, more consistent dependency relationships among variables in each subset. This paper extends an earlier work of ours and proposes several contributions to the field of clustering-based BMN classifiers, using Ant Colony Optimization (ACO). First, we introduce a new medoidbased meth...

متن کامل

خوشه‌بندی اسناد مبتنی بر آنتولوژی و رویکرد فازی

Data mining, also known as knowledge discovery in database, is the process to discover unknown knowledge from a large amount of data. Text mining is to apply data mining techniques to extract knowledge from unstructured text. Text clustering is one of important techniques of text mining, which is the unsupervised classification of similar documents into different groups. The most important step...

متن کامل

Study of Text Mining Using Hybrid Agglomerative Clustering With ACO Algorithms

Textual document clustering technique was introduced in the area of text mining. The two important main goals in document clustering are achieving high performance or efficiency and obtaining highly accurate data clusters that are closed to their natural classes or textual document cluster quality To enhance this work, we are going to propose a new hybrid clustering algorithm using Agglomerativ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Annales UMCS, Informatica

دوره 3  شماره 

صفحات  -

تاریخ انتشار 2005